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Abstract. Chebyshev observed in a letter to Fuss that there tends to be more prunes of 
the form 4n + 3 than of the form An + 1. The general phenomenon, which is referred to as 
Chebyshev's bias, is that primes tend to be biased in their distribution among the different 
residue classes modq. It is known that this phenomenon has a strong relation with the low- 
lying zeros of the associated L-functions, that is if these L-functions have zeros close to the 
real line, then it will result in a lower bias. According to this principle one might believe that 
the most biased prime number race we will ever find is the Li(a;) versus tt{x) race, since the 
Riemann zeta function is the L-function of rank one having the highest first zero. This race 
has density 0.99999973..., and we study the question of whether this is the highest possible 
density. We will show that it is not the case, in fact there exists prime number races whose 
density can be arbitrarily close to 1. An example of race whose density exceeds the above 
number is the race between quadratic residues and non-residues modulo 4849845, for which 
the density is 0.999999928... We also give fairly general criteria to decide whether a prime 
number race is highly biased or not. Our main result depends on the General Riemann 
Hypothesis and on a hypothesis on the multiplicity of the zeros of a certain Dedekind zeta 
function. We also derive more precise results under a linear independence hypothesis. 



1. Introduction and statement of results 

The study of prime number races started in 1853, when Chebyshev noted in a letter to 
Fuss that there seemed to be more primes of the form An + 3 than of the form An + 1. More 
precisely, Chebyshev claims without proof that as c — )■ 0, we have 



J2 (—] e-P" = 6-^^= - e-^' + e-^' + e 



P 



- _ . . . ^ oo. 



However, as Hardy and Littlewood |HL| and Landau |Lanlt[I7an2| have shown, this statement 
is equivalent to the Riemann hypothesis for L{s,X-4), where X-a denotes the primitive 
character modulo 4. 

The modern way to study this question is to look at the set of integers n for which 
7r(n;4,3) > 7r(n;4, 1), which we denote by -P4;3,i. One would like to understand the size of 
this set, however it is known that its natural density does not exist [K]. To remedy to this 
problem we define the logarithmic density of a set P C N by 

5(P) := lim V -, 

Af^oo log iV n 

^ n<N 
neP 

if the limit exists. In general we define S{P) and 5{P) to be the liminf and lim sup of this 
sequence. If P = P4;3,i, then this last limit exists under the assumption of the General- 
ized Riemann Hypothesis (GRH) and the Linear Independence Hypothesis (LI), and equals 
0.9959... (see [RS]). 



The General Riemann Hypothesis states that for every primitive character x mod g, 
all non-trivial zeros of x) li^ on the line 3f?(s) = |. 

The Linear Independence Hypothesis states that for every fixed modulus g, the set 

U {53(pJ : L(p^, x) = 0, < 5R(pJ < 1, 53(pJ > 0} 

X mod q 
X primitive 

is linearly independent over Q. 

For a good account of the history of the subject as well as recent developments, the reader 
is encouraged to consult the great expository paper [GMJ . 

Rubinstein and Sarnak developed a framework to study this question and more gen- 
eral "prime number races". Assuming GRH and LI, they have shown that for any r-tuple 
(ai,...ar) of admissible residue classes modg (that is (aj,g) = 1), the logarithmic density 
of the set Pq;ai,...,ar '■= • ^("-i Q') cti) > ^(2^; ?5 02) > ■ ■ ■ > 7r(x; q, flr)}, which we denote by 
6{q; Oi, . . . , Or), exists and is not equal to or 1 (we call this an r-way prime number race). 
Moreover, they have shown that if r is fixed, then as g — )■ 00, 

(5(g;ai,...,a/ 



max 

l<ai,...,ar<<!' 
(ai,q)=l 



0. 



In other words, the bias dissolves as g — )■ 00. For r = 2, this phenomenon can readily seen 
in |FiMa| . where the authors exhibit the list of the 117 densities which are greater than or 
equal to 9/10. By the trivial inequality 

P c P 

we see that the most biased r-way prime number race is the two-way race appearing on top 
of the list in |FiMa| . that is 

5(24; 5,1) = 0.999988... 

Only one race is known to be more biased: it is the race between Li(a;) and vr(a;), for which 
the density is 

5{l) ■= 5{{n : U{n) > 7i{n)}) = 0.99999973... 
One can also combine different residue classes modg to make prime number races. For 
two subsets A,Bg (Z/gZ)^, we consider the inequality 

^^7r(a;;g,a) > -^^7r(s;g,6), (1) 

' ' aeA ' ' beB 

and denote by 6{q; A, B) the logarithmic density of the set of x for which it is satisfied, if it 
exists. An example of such race was given by Rubinstein and Sarnak who studied the race 
between 

7r(a;; g, NR) = ^{p < x : p is not a quadratic residue mod g} 

and 

7i{x; g, R) = #{p < X : p is a. quadratic residue mod g}, 
for moduli g having a primitive root. This race appears naturally in their work, since as 
they have shown, it is the property of the competitors being a quadratic residue or not 
which determines whether a two-way prime number race is biased or not. These are good 
candidates for biased races, however it can be shown that as g — )■ 00, 6{q; NR, i?) — )■ ^ (but 
at a much slower rate than two-way races, see |FiMa| ). 
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In general, one can see ( |BFHR| . |FiMa] ) that low-lying zeros (excluding real zeros) have 
a significant effect on decreasing the bias. However, real zeros have the reverse effect, and 
increase the bias. Nonetheless, real zeros are very rare, in fact Chowla's conjecture asserts 
that Dirichlet L- functions never vanish in the interval s G (0, 1]. 

Odlyzko [O] has shown that the Dirichlet L-function having the highest first zero in the 
critical strip is the Riemann zeta function, which is Pq = ^ + i ■ 14.134725... Subsequently, 
Miller [MiJ generalized this result by showing that each member of a very large class of 
cuspidal GLn L-functions has the property of either having a zero in the interval [| — 
14.13472?, | + 14.13472z], or having a zero whose real part is strictly larger than 1/2 (violating 
GRH). In particular, this class contains all Dirichlet, rational elliptic curve and modular form 
L-functions, and possibly also contains all Artin and rational abelian variety L-functions. 
By these considerations, one might conjecture that the highest density one will ever find by 
doing prime number races is 5{1) = 0.99999973... 

As it turns out, this is false, and we can find races which are arbitrarily biased. This is 
achieved by considering races between linear combinations of prime counting functions, and 
we will see in Section [5] that the key to finding such biased races is to take a very large 
number of residue classes. 

The first (and most extreme) example we give is a quadratic residue versus quadratic 
non-residue race as in |RSj . but for a general modulus q. We take A = NR := {a mod q : 
a = □ mod g} and B = R := {6 mod g : b ^ □ mod g} in ([T]). Note that \B\ = 4'{q)/p{q) 



and \A\ = 0(g) (^1 - where 



p(g) := [G : G' 



'2^(9) if2tg, 

2^(9)-i if 2 I g but 4 t g, 

2^('') if 4 I g but 8 t g, 

^2^g)+i if8|g. 



and u{q) denotes the number of distinct prime factors of g. 

Theorem 1.1. Assume GRH and LI. Then for any e > there exists q such that 

l-e<6{q;NR,R) <1. (2) 
Moreover, for any fixed ^ < rj < 1 there exists a sequence of moduli {g„} such that 

lim 5(g„; NR, R) = r]. (3) 

n— >oo 

In concise form, 
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{5{q-NR,R)} 



To prove the existence of highly biased races we do not need the full strength of LI, in fact 
we only need a hypothesis on the multiplicity of the elements of the multiset of all non-trivial 
zeros of quadratic Dirichlet L-functions modulo g, which we will denote by Z{q). Note that 
LI implies that the elements of this set have multiplicity one. 

Theorem 1.2. Assume GRH, and assume that there exists an increasing sequence of moduli 
q such that logg = o{p{q)) and such that each element of Z{q) has multiplicity o{p{q)/ logg). 
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Then for any e > there exists q such that 

1 - e < NR, R) < S{q; NR, R) < 1. 



(4) 



Remark 1.3. The difference between ([2]) and (jl]) is explained by the fact that it is not 
known whether 6{q; NR, R) exists under GRH alone. 

Remark 1.4. For a fixed modulus q > 2, write q = 2^ Y\p\q P^^ ^'^d I := Jlpk P- ""^^ 
see that 

since there are no real primitive characters modulo p"^ with p ^ 2 and e > 2, and there are 
no real primitive characters modulo 2^ for e > 4 (see Lemma [STT]) . Therefore, when studying 
S{q; NR, R) one can assume without loss of generality that q is of the form 2™£, where i is 
an odd squarefree integer and m < 3. 

Remark 1.5. We will see that what controls the bias in these races is the number of prime 
factors of q and the size of q. More precisely, under GRH and LI the two following statements 
are equivalent: 

^logp = o(2-(^)), (5) 

p\q 

5{q;NR,R) = 1- o{l). (6) 
Using this, we can show that the set of moduli q < x such that 5{q; NR, R) = 1 — o(l) 
has density (logx)^'^^"^^'', where A = 1 — = 0.086071... Interestingly, Ford's work 

on integers having a divisor in a given interval (see |Fol] . [Fo2] ) shows that these integers 
appear in the Erdos multiplication table. 

In terms of random variables, this can be explained by saying that the extreme examples 
we are considering correspond to random variables whose mean is much larger than their 
standard deviation. The easy way to show that this implies a very large bias is to use Cheby- 
shev's inequality; however this approach is quite imprecise when the ratio E[X]/ ^/Vai\X] 
is large. Instead, one should study the large deviations of X — E[X]. The theory of large 
deviations of error terms arising from prime counting functions was initiated by Montgomery 
|Mo| . and has since then been developed by Monach [MnJ . Montgomery and Odlyzko [MoOdj . 
Rubinstein and Sarnak |RS] . and more recently Lamzouri [Lam]. Exploiting such ideas we 
are able to be more precise in ([2]). 

Theorem 1.6. Assume GRH and LI, and define q' := Y[p\qP- P{l) / ^^ZQ' ^■^ large enough, 
then we have 

exp (-a,p^\ < 1 - 5{q- NR, R) < exp {-a^^^^ , 

where ai and a2 are absolute constants. 

This last theorem shows that the convergence in ([2]) can be quite fast. It is actually 
possibly to explicitly compute a density which exceeds 5(1), namely 5(4849845; Xi?, i?) = 
0.999999928... Below we list the first few values of S{q; NR, R) for half-primorial moduli 
(that is, q is the product of the first k primes excluding p = 2). These values were computed 
using Mysercough's method |My| and Rubinstein's Icalc package. 
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uj{q) 


p(g)/log q' 


5{q- NR, R) 


3 


1 


1.82 


0.999063 


15 


2 


1.47 


0.999907 


105 


3 


1.71 


0.999928 


1155 


4 


2.26 


0.999877 


15015 


5 


3.33 


0.999950 


255255 


6 


5.14 


0.9999946 


4849845 


7 


8.31 


0.999999928 


111546435 


8 


13.81 


0.999999999954 



Remark 1.7. As remarked by Rubinstein and Sarnak |RS| . these densities can theoretically 
be computed to any given level of accuracy under GRH alone. Indeed, using the almost- 
periodicity of these races, this amounts to computing a finite number of zeros of Dirichlet 
L-functions to a certain level of accuracy. 

Remark 1.8. One can summarize Remark ll.5[ Theorem 11.11 and Theorem 11.61 by the fol- 
lowing statement: 

NR, R) ^ -j= / e-'^dx. 

V 27r i-^2"{9)-i/logg' 

Remark 1.9. Using our analysis, one can show that for almost all squarefree integers q, 

5(g;iVi?,i?)-l = (logg)^+''«. 
That is to say, races with normal moduli have a very moderate bias. 

It is possible to analyse highly biased races in a more general setting, and to determine 
which features are needed for this bias to appear. To do this we take "5" = (ai,...,afc) a 
vector of invertible reduced residues modulo q and 7^ = {ai,...,ak) a non-zero vector of 
real numbers such that ^j^^^ = 0. We will be interested in the race between positive and 
negative entries of it, that is we define 

6{q; it, 1^) := 6{{n : aiTc^n; q, ai) + ... + afc7r(n; q, at) > 0}). 

Moreover, we define 

{1 if Oj = □ mod q 
if ^ □ mod g, 

and we assume without loss of generality that 

k 

^ eifti < 0. 

i=l 

(By Lemma (5. 11 this will force S{q; "a^, "3") > |. If X]i=i ^i'^i — 0; then S{q; "a?", "ci") = |. If 
Yli=i ^i^i > 0; then we multiply 75" by minus one and study the complementary probability 
6{q; it , —it) = 1 — 5{q] it, it).) 

There are many choices of vectors it and it which yield highly biased races. We give 
some examples with constant coefficients, which we believe are the most natural. 
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Theorem 1.10. Assume GRH and LI, and let kn < ^||y and k^ < ^1 — 0(g) he 
two positive integers. Take ai,...,akj^ to be any distinct quadratic non-residues modg, with 
coefficients ai = ... = = k^, and ak^^+i, ....ak^+kp. to he any distinct quadratic residues 
modg, with a^^+i = ... = a^^+fc^ = —k^- There exists an absolute constant c > such that 
if for some < e < ^ we have 

kn kn 0(g) log g' 

then 

6{q; "S") > 1 — ce. 

Remark 1.11. Fix < e < ^ and define N^{q) to be to number of positive integers k^, k^ 



for which fcjv < (1 - 7^)0(9), < and 



kN kji 0(g) log g 

Then, for values of g for which p(g) > e~^logg, we have that N^{q) tends to infinity as 
g — )■ oo. Hence, for values of g for which logg = o(p(g)), ([7]) has a large number of solutions. 

Remark 1.12. Theorem 11.101 shows the existence of highly biased races with the same 
number of residue classes on each side of the inequality. Indeed, for moduli g with logg = 
o(p(g)), taking kfj = k^ with 0(g) log g/p(g)^ = o(A;/j) and choosing any residue classes 
ai, ak^+kn gives a race with 5(g; "cJ") = 1 — o(l). 

Remark 1.13. In Theorem I l.H we have kjy = ^1 — (j){q) and k^ = |||y, which explains 
why we obtained a highly biased race when p(g) was large compared to logg. 

Here is our most general class of highly biased races. 

Theorem 1.14. Assume GRH and LI. There exists an absolute constant c > such that if 
for some < e < we have 

Ell «f ^ ^ p{qf 



then 



Remark 1.15. Trivially, one has 



5(g; "a^, "3") > 1 — ce. 



where k^ := Ei=i ^j- Hence, for ([8]) to be satisfied, one needs k^ to be larger than 

_i0(g) logg 



P(g)^ 
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Since kji < this imposes the following condition on q: 

piq) > e^Mogg. 

Remark 1.16. The goal of Theorem 11.141 is to give a large class of biased races, without 
necessarily being precise on the value of 1^, it). One can use the Montgomery-Odlyzko 
bounds |MoOd] to obtain more precise estimates in some particular cases. 

The previous examples of highly biased races all have the property that the number of 
residue classes involved is very large in terms of q (it is at least q^^"'^^^). In the next theorem 
we show that this condition is necessary, and that moreover highly biased are very particular, 
in the sense that they must satisfy precise conditions. 

Theorem 1.17. Assume GRH and LI. There exists absolute positive constants Ki,K2 and 
< rj < 1/2 such that if k < Ki(f){q) and 
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^ ^^^ 0(g)log(30(g)A) ^ 



then 

6{q;~^,'^) <l-r]. (10) 
(Hence this race cannot be too biased.) 

Remark 1.18. Applying the Cauchy-Schwartz and using that kji := Yli=i^i — 4'{'i) / pi.Q) ^ 
one sees that if p(g) < log (30(g) /fc), then whatever and "c^ are, ([9]) holds. Moreover, 
in the range p{q) > K2\og{3(j){q)/k) we have that if kn < K2(t){q) / p{qf , then ^ holds. We 
conclude that a necessary condition to obtain a highly biased race is that k^ ^ 4'{(l) / pIq)"^ ■ 

An interesting feature of prime number races is Skewes' number. It is by definition the 
smallest xq for which 

7r(xo) > Li{xo). 

This number has been extensively studied since Skewes' 1933 paper in which he showed 
under GRH that 

xo<10i°"". 

The GRH assumption has since then be removed and the upper bound greatly reduced; we 
refer the reader to |BHj for the list of such improvements. The current record is due to Bays 
and Hudson |BH| . who showed that Xq < 1.3983 x 10'^^^, and moreover this bound is believed 
to be close to the true size of Xq. 

One could also study the generalized Skewes' number 

Xg.a,b ■= inf {x : tc{x; q, a) < n{x; q, b)}. 

However, two-way prime number races become less and less biased as q grows, that is 
S{q]a,b) — )■ I uniformly in a and b coprime to q. Hence, for large q we expect this gen- 
eralized Skewes number to be small and uninteresting. 

The situation is quite different with the highly biased we constructed, in fact we expect 
the Skewes number 



Xg := mi{x : (p(g) — l)vr(x; q, NR) < n{x; q, R)} 
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to tend to infinity as p{q)/ log q' tends to infinity {q' is the radical of q). One can then 
ask the following question: how fast does it tend to infinity? Similar arguments to those 
of Montgomery |Mo] and of Ng [Nj allow us to make the speculation that the answer is 
double-exponentially. 

Conjecture 1.19. As p{q)/\ogq' tends to infinity we have 

log log X -. 

log q' 
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2. Results without the linear independence hypothesis 

The goal of this section is to prove Theorem 11.21 (from which the first part of Theorem ll.il 
clearly follows). We first note that ii A = NR and B = R, then ([1]) is equivalent to 

7r(x; q, NR) > {p{q) - l)7r(x; g, R). 

Lemma 2.1. Assuming GRH, we have that 

7r(x; g, NR) - {p{q) - l)7r{x; q, R) i ^ sr^ x'^^ 

X mod g 7x 

x¥=xo 

Proof. Let b be an invertible reduced residue modg. We will use the orthogonality relation 



X mod q 
X^=XC 
Xt^XO 

The explicit formula gives 



1—1 it ^ U mod q. 



X mod q x mod q Px 



X =X0 X =X0 

Xt^XO Xt^XO 
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where runs over the non-trivial zeros of x)- The left hand side of (fT2|) is equal to 

X mod q p<x x mod ij p'^<x 

X^=XO X^=XO 
X¥=X0 Xt^XO 

= - 1) XI logP- 5Z logP+ (P(?) - l)v^ + o(v^), 

p<x P'^x 
p=n mod g P^fl mod g 

by fflT]) and the Prime Number Theorem. Combining this with a standard summation by 
parts we get that 

7t{x; q, NR) - (p(g) - l)7r(x; q, R) i , ^ x'^^ 

^ ' X mod q 7x '^'^ 

Xt^XO 

□ 

Lemma 2.2. T/ie quantity Eq{x) defined in Lemma \2. 1\ has a limiting logarithmic distri- 
bution, that is there exists a Borel measure fig on M such that for any bounded Lipschitz 
continuous function f : M. ^ M. we have 



hm i / fiEgiey))dy = [ f{t)dpg(t). 



Proof. This follows from Rubinstein and Sarnak's anaysis |RS] . and from |ANS] . □ 

Remark 2.3. As Schlage-Puchta has pointed out to me, it is possible to show under GRH 
that for all but a countable set of values of c, the density 

Fg{c) := lim —meas{y < Y : -Eg(e^) < c} 
y— >-oo Y 

exists. Moreover, one can show that in the domain where F is defined, 

supFg(x) < liminf — meas{y < Y : Egle^) < c} 
x<c y-)>oo Y 

< lim sup —meas{y < Y : -Eg(e^) < c} < inf Fg{x), 

Y->OD Y x>c 

and so in particular if Fg{x) is continuous at x = c, then the set {y < Y : Eqle^) < c} has a 
density. 

Let Xq be the random variable associated to fig. We will show that Xg can be very biased, 
in the sense that Prob[Xg > 0] can be very close to 1. To do so we will compute the first 
two moments of i?g(e^), which we relate to the random variable Xg. 

Lemma 2.4. We have that 

-y 



lim ^ [ Eg{ey)dy= [ tdfig{t), 



lim i r E,{eyfdy= I 



Proof. We will only prove the second statement, as the first follows along the same lines. 
Similarly as in |SP| . we can compute that 



1 

< 00, 



_L ^ _L n P1P2P3P4 

where the last sum runs over quadruples of non-trivial zeros of quadratic Dirichlet L-functions 
modulo q. This implies that as M — > 00, 

limsup^ / \EJey)\^dy — ^ 0. (13) 

Indeed, if this was not the case then we would have that for all M > Mi 



0; 



limsup— / \Ea(ey)\'^dy > n > 0, 



\Eq{ey)\>M 



and so 



limsup [ |E„(e^)|^rf?/ > r^M^ 



\E^{ey)\>M 

which would contradict the fact that the fourth moment is finite. We now define the bounded 
Lipschitz function 

't^ if |t| < M 

Huit) := {M'^{M + l-\t\) ifM<|t|<M+l 



if|t|>M + l. 



We then have 



\Eq{ey)\>M 



y[ E,{eyfdy=^! HM{E,{ey))dy - y f ^^^^^ HM{E,{ey))dy 

>^2<jy<y M<\Eg{ey)\<M+l 

therefore by f[T^ and by Lemma [2.21 we get that 

limsup:^ I Eg{ey)^dy = f HM{t)dfiq{t) + eM, 

where eM tends to zero as M — )■ 00. Using the bound 

fig{{-oo, -M] U [M, 00)) < exp{-C2\^) 
(see Theorem 1.2 of [RS]) we get by taking M ^ 00 that 

limsup;^ / Eg{eyfdy= [ t^dfig{t). 
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The same reasoning applies to the hminf, and thus the proof is finished. 

□ 



The following calculation is similar to that of Schlage-Puchta |SP| . who computed the 
moments of e~^^'^ip{e^; x)- 



Lemma 2.5. Assume GRH. Then, 



E[X,] = p{q) - 1 + z{q), Var[X,] = J2t 



2 



where the last sum runs over the imaginary parts of the non-trivial zeros of 

Z,is):= n Lis.x), 

X^=XO 

x¥=xo 

denotes the multiplicity of the zero | + ^7, the star meaning that we count the zeros 
without multiplicity, and z{q) denotes the multiplicity of the (possible) real zero 7 = 0. 

Proof. By Lemma [2. II we have that 

r E,{ey)dy = {p{q) - I + z{q)){Y ~ 2) + ^ ^ T e^7x.rf?/ + Oy^ool^-) 

Xt^XO 

= - 1 + z{q)){Y - 2) + (1) + Ov^ooiY), 
by absolute convergence. Taking Y ^ 00 and applying Lemma [2.41 gives that 

1 r"" 

E[Xg] = hm - / E,{ey)dy = p{q) - 1 + z{q). 

The calculation of the variance follows from Lemma 12.11 and from Parseval's identity for 
almost-periodic functions |B1] . (An alternative way to compute the variance is to argue 
as in fSPl.) □ 



Remark 2.6. It is a general fact that Besicovitch almost-periodic functions always have a 
mean value [B2J . Moreover, Parseval's identity [Bit iB2j shows that Besicovitch (and thus 
also Stepanov 5*^, Weil and Bohr) almost periodic functions f{y) have a second moment 
given by 



hm i r f{yfdy = Y^Al 



where the An are the Fourier coefficients of /. 



Lemma 2.7. Assume GRH. If 
is large enough, then 

%;iVi?,i?)>l-2^"^[^' 
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Proof. It is clear from Lemma [2.51 and the Riemann-von Mangoldt formula that Var[Xg] ^ 
logg', and therefore our assumption that B{q) is large enough implies that E[Xg] is also large 
enough, say at least 4. Define 

r , f if X < 

, if X < 

H(x):=l ^ f(x):=lx if < x < 1 

^ ^ 1 1 if a; > •'^ ^ I - 

^ - [l ifx>0. 

Clearly, f{x) is bounded Lipschitz continuous and f{x) < H{x). Therefore, 

6iq; NR, R) = liminf 1 /"^ H{E^{ey))dy > liminf i /"^ f{E,{ey))dy, 
which by Lemma [2.21 is equal to 

f{t)d^,it) = 1 - /(I - fit))d^,{t) 
Jr 

= 1-1 (1 - f{t))dfi,{t) > 1 - /i,(-oo, 1]. 

J —00 

We now apply Chebyshev's inequality: 

/Xg(-cx), 1] = Prob[Xg < 1] = Prob[X, - E[X,] < 1 - E[XJ] 

Var[X,] ^ Var[X,] 



< Prob[|X, - E[X,]\ > E[X,] - 1] < T^^nrf^ < 2 



(E[X,] - 1)2 - E[X,]2 
since E[Xg] > 4, and therefore 



6{q] NR, R)>l-2 



Var[X,] 
E[X,]2 ■ 

□ 



Proof of Theorem M.B. By Lemma [275| our hypothesis implies that for the sequence of moduli 
q under consideration. 



Var[Xg] < max(m^) V = [y^p{q) logg ) = o(p(g) 



2^ 



by the Riemann von-Mangoldt formula. Lemma 12.51 also implies that E[Xq] ^ and 
hence Lemma [2.71 implies that 

S{q;NR,R) > 1 - o(l). 

The last inequality to show, that is 5{q; NR, R) < 1, follows from an analysis using the 
functions f{x) and H{x) of Lemma 12. 7[ combined with a lower bound on 00, —1] 

similar to that in Theorem L2 of |RS| . which holds in greater generality |ANS| . 

□ 
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3. A CENTRAL LIMIT THEOREM 

The goal of this section is to show a central limit theorem under GRH and LI, from which 
the second part of Theorem 11.11 will follow. We first translate our problem to questions on 
sums of independent random variables, which can be done thanks to hypothesis LI. Recall 
that we are interested in the set of n such that 

n{n- q, NR) > - l)7r(n; q, R). 

Lemma 3.1. Assume GRH and LI. Then the logarithmic density of the set of n for which 
7r(n; q, NR) > {p{q) — l)TT{n; q, R) exists and equals 

Prob[Xg] > 0, 

where Xg is the random variable defined in Section\^ Moreover we have 

.Y,~p(4)-1+ Y. E^|=. (") 

X mod g 7x>0 y 4 + 7x 

where the Z^^ are independent identically distributed random variables following a uniform 
distribution on the unit circle in C. 

Proof. By Lemma [2. 1^ we have that 

7r(x;g,A^i?) - (p(g) - l)7r(x;g,i?) i , ^ x^^^ 

X mod g 7x 
X^=XO 
Xt^XO 

since LI implies that there are no real zeros. It follows by the work of Rubinstein and Sarnak 
that 6{q; NR, R) exists and equals Prob[Xq > 0] (their analysis shows that the distribution 
function of Xg is continuous). Moreover, an argument similar to the proof of Proposition 2.3 



of [FiMa] shows that holds. □ 

One can show that the random variables appearing in ( !T^ have variance Var[3ft(Z^^)] = i, 
and have mean IE[^'y^] = 0. Using this and the fact that they are mutually independent, we 
recover Lemma [2.51 

E[X,] = - 1, Var[Xj= E (15) 

X mod g 7x ^ ""^^ 
X^=XO 
Xt^XO 

since the zeros come in conjugate pairs (x is real). We will see in the following lemma that 
Var[Xg] X p[q) logg' (recall that q' := HpigP)' ^^"^ ^^^^ ^ crucial fact in our analysis. 

Lemma 3.2. Assume GRH and let Xg be the random variable defined in f[T^ . We have that 

'log log g'' 



Var[Xg] = 2'^(«)-^-^« log q' 



1 + 



log q' 



where tg = 1 if 2 \ q, and = otherwise. In particular, 

Var[Xg] X p(g)logg'. 
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Proof. By Remark II. 4[ we have that 

Var[X,] = Var[X2e,], 

where e < 3, 2^ || g and i := Hplg Therefore we assume from now on (without loss of 

generahty) that q = 2'^£, with e < 3 and i an odd squarefree integer. 
Lemma 3.5 of |FiMa] gives that 

E = ^ogq* - logTT - 7 - (1 + log 2 + 25J^(1, xl 

tr 4 + 7^ L (16) 

= logg* + 0(loglogg*), 
by Littlewood's GRH bound on ^(l,x). Plugging this into f|T5|) we get 

Var[X,] = J2 log 1* + C'(2"'(«) log log q). 

X mod q 

If g is odd, then there is exactly one primitive real character mode? for every d \ q, hence 
^ logg* = Y^logd = J2J2^ogp = ^logp2-(«)-i = 2-('^)-Mogg. 

X mod q d\q d\q p\d p\q 



If 2 II g, then there are no primitive characters modulo even divisors of g, so 

2' 



J2 logg* = ^logrf = 2-('')-2log^. 



X mod q d| | 

If 4 II g, then there is exactly one primitive real character modulo divisors which are a 
multiple of 4, so 

J2 log g* = 5^ log + 5^ log = 2-(«)~2 log(2g) . 

Xmodq c!|| i\d\q 

If 8 II g, then there are exactly two primitive real characters modulo divisors which are a 
multiple of 8, so 

J2 log g* = 5^ log d + 5^ log + 2 5^ log d = 2'^(«)-2 log(8g) . 

Xmodg d|| i\d\q 8\d\q 

X^=XO 8\d 

□ 

Let Xq be the random variable defined in ([Ej), and define 

v/Var[X,] 

It is B{q) which dictates the behaviour of the race we are considering: if -B(g) is small, then 
the race will not be very biased, whereas if -B(g) is large, then the race will have a significant 

14 



bias. By Lemma [3. 2 [ we have the estimate 

log log g' 



B{q) 



' 2i^('7) + l+f9 

log q' 



1 + i 2-'^('?) + 



log q' 



(17) 



To prove the second part of Theorem 1 1.1 1 we will need a sequence of moduli for which B{q) 
is very regular. 

Lemma 3.3. For any fixed < c < oo, there exists an increasing sequence of squarefree odd 
integers {g„} such that 

2'^('^")+^ = (c + o(l))logg„. 

Proof. Fix < c < oo, and define ■= min{e > 1 : 2~^c < and Ci := 2"^''c < 

Define for £ = 1, 2, ... the intervals 

le := (exp(cr'20,2exp(q'20), Je := (2exp(q ^20,4exp(cr'20). 
Since ci < we have that for all £ > 1, 

^ log 4 — ' 

4exp(cr^20 < exp(cr^2^+^); 

hence our intervals are all disjoint. We define pi to be any prime in the interval le, and 
similarly for p'^ E Je. The existence of such primes is granted by Bertrand's postulate (note 
that exp(c^^2^) > 4). Now, the sequence of moduli we are looking for is 

Qn ■■= n n 

l<l<ec l<i<n 

since 

2aj(g„)+l 2"+'^<:+l 2"+'^c+l 



loggn Oe(l) + Ei<^<„(cr'2^ + 0(1)) cr'2"+i + Oe(n) 
= 2-ci (l + 0,(^))=c(l + o(l)), 



by definition of Ci. 



□ 



Before proving the second part of Theorem II. H we give some information about the 
characteristic function of the random variables we are interested in. The following lemma 
implies a central limit theorem. 

Lemma 3.4. Let Xg be the random variable defined in (IT^ . and define 



9 

........ ^ . ^ 2 

X 

X"=XO 
Xt^Xo 



The characteristic function ofYq satisfies, for \^\ < | a/ Var[Xq] , 
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Moreover, in the same range we have 



Proof. The proof is very similar to that of Theorem 3.22 of |FiMa] . Using the additivity of 
the cumulant-generating function of Xq, one can show that 



iogx,(o = ^E[x,]e+ Yl Eiog ^o 



2^ 



X mod q 7x>0 
Xt^XO 



4 ^ 'X, 



(19) 



We will use the following Taylor expansion, which is valid for |^| < (see Section 2.2 of 



FlM^ ): 



logJo(0 = -j + 0(a. 



(20) 



Plugging this estimate into f lT^ we get that for |^| < 



X mod g 7t(.>0 4 
X^=XQ 

x¥=xo 



( 



e y y — ^ 

Xmodg7;(.>0 V4 ' / 



\ X =XQ 

\ x¥=xo 



\ 



J 



One can show using the Riemann-von Mangoldt formula that 

X mod (J7x>0 V4 ^ / 
X^=XO 

x¥=xo 

Moreover, by Lemma [3.21 we have Var[Xg] x p{q) log q'. Putting these together and using 
f fTSj) . we get that 



iogy,(0 = iogx. 



yVa^ 



- iE[Xg 



^/y^x'q 



p{q) log q' 



showing the first assertion. For the second we use the same argument, but we replace the 
estimate (120|) with the following inequality, valid in the range |^| < y: 

e 



logJo(0<-j- 



Lemma 3.5 (Berry-Essen inequality). Denote by Fg the distribution function of 

X, - E[Xg] 



□ 



1 
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and by F that of the Gaussian distribution. We have that 

sup \F,{x) - F{x)\ < -j^ -. 

xm p[q) log q' 



Remark 3.6. One could get a more precise estimate using the Mart in- Feuer verger formula 
|FeMaJ . However, the estimate of Lemma [3.51 is sufficient for our purposes. 

Proof. Since the statement is trivial if p{q) \ogq' is bounded, we can assume without loss of 
generality that Var[Xg] > 1 (by Lemma [3.21) . 

The Berry-Esseen inequality in the form given by Esseen (Theorem 2a of ^) gives that 
for any T > 0, 



sup \Fq{x) - F{x)\ < 



— 62 1 

c/f + -. 

i ^ T 



(21) 



We take T := Var[Xg]. By Lemma [3.4[ the part of the integral with |^| < |Var[Xq]4 is at 
most 



;Var[X,]3 



e 2 



P{9) log ij' 



1 



fVar[X,]5 



1 



p{q) log q' 



1 



p(g)logg' 



We now bound the remaining part of the integral using an argument analogous to Proposition 
2.14 of |FiMa] . Fix < A < |. By the properties of the Bessel function Jq{x), we have that 
if 1^1 > A, then whatever 7^ G M is. 



2^ 



i + ^2 




By ([IH]), this shows that in the range |^| > j^Yai[XqY^ we have |Xg(^)| < |Xg(^Var[Xg]~3 
(since Var[Xq] > 1), and so 



fVar[Xq]3<|5|<Var[Xq] 



^^(^) ^ -rfe«nf4var[X,]MlogVar[X,] + 



12 



e 2 



di 



C|>|Var[X,]5 



by ([T8|) . Applying Lemma |3.2[ we conclude that the right hand side of ([2T]) is at most a 
constant times (p(q') logg')~^. 

□ 

Proof of Theorem M.li second part. Fix rj G [|, 1]. We wish to find a sequence of moduli {g„} 
such that 6{qn, NR, R) — )■ 77. The case 77 = 1 was already covered in part (1), and the case 
?7 = I follows from taking prime values of q, by the central limit theorem of Rubinstein and 
Sarnak |RS| . Therefore we can assume that | < r/ < 1. 
Let K > be the unique real solution of the equation 

1 



'2ti 



e 2 dt 
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Let moreover {g„} be the sequence of squarefree odd integers coming from Lemma [3.31 for 
which 

2-('^")+i = logg;(«:2 + o(l)). 
By (fT7|) . this gives that as ri — )■ oo, 



Define 



B{qn) := -7=== ^■ 
v/Var[XgJ 



We will use the central limit theorem of Lemma 13.41 as well as the Berry-Essen inequality 
f l^T]) . Denoting by Fg^ the distribution function of Yg^ and by F that of the Gaussian 
distribution, we have that 

\Siqn, NR, R)-v\ = \Proh[Xg„ > 0] - v\ = |Prob[X,„ < 0] - (1 - 
= |F,„(-i?(g„))-F(-«:)| 

p(g„)logg; 

by Lemma 13.51 and by the fact that the probability density function of the Gaussian is 
bounded on M. Looking at the proof of Lemma [3.31 we see that p{qn) — > oo, hence this last 
quantity tends to zero as n — )■ oo, concluding the proof. □ 

4. A MORE PRECISE ESTIMATION OF THE BIAS USING THE THEORY OF LARGE 

DEVIATIONS 

To give a more precise estimate for the bias we are interested in under LI, we use the 
theory of large deviations of independent random variables. The fundamental estimate of 
this section is Theorem 2 of |MoOd] . 

Theorem 4.1 (Montgomery and Odlyzko). Forn = 1, 2, ... let Yn be independent real valued 
random variables such that E[y„] = and \Yn\ < 1. Suppose that there is a constant c > 
such that E[y^] > c for all n. Put Y = TnYn where X] < 
IfY.w.\>o.K\<V/2 then 



Prob[Y > V] < exp - ^ r^^ 



-1 



|r'n|<a 

IfT.w.\>.K\>W then 



Prob[Y >V]>ai exp ( - a^V'^ ( XI ) • 



Here ai > and a2 > depend only on c. 



To make use of these bounds we need to give estimates on sums over zeros. 
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Lemma 4.2. For T > 1 we have 



V J. = - log(g*v^) logT + 0(log(g*T)). 

^ /l I ^,2 TT 



hx\<T\/i + ll 



Proof. We start from the von Mangoldt formula: 

iV(T,x) = - log ^ + 0(logg*T). 
With a summation by parts we get 



E = 0(logg*)+ / 



, + / jdt + 0{\ogq 



^^^^rft + 0(log(g*r)) 
- log(g* VT) log T + O (log q*T) . 

TT 



Lemma 4.3. Let J^{q) he a subset of the invertible residues modg such that x ^ -^{(l) 
X £ ^{(l) ■ Define the random variable 



□ 



where the Z^^ are i.i.d. uniformly distributed on the unit circle. Then, we have for q large 
enough that 

aiexp (^-aj-^^ < Prob[Y > |^(g)|] < exp (^-aj-^^ , 
where the ai are absolute constants and 

' — mr~ - ~ 

Proof. It is a direct application of Theorem 14.11 Taking the sequence {rj} to be the —rj 



2 



ordered by size, we have for < a < 4 that 

19 



For the upper bound we take a = 4: then we trivially have ^|^^|>q, \rn\ < \J-'{q)\/2, so 
Prob[r > < exp [ -^1-^(^)1' I Ci Yl ^^S^* ) ) 



for some absolute constant Ci. For the lower bound we take a = 2/^1 + Tq^, where Tq > 1 
is a fixed large real number (independent of q and J^{q)) such that 

whose existence is granted by Lemma I4l2] (we grouped together conjugate characters). Then 
Theorem 14.11 gives the bound 

Prob[y > |^(g)|] > c^exp { -c,\J^{q)\' ( E E I ^ 



+ 7^ 
xeJ-(<?) 7x>ro 4 ' 

> C2exp I -C3|J'(g)|^ I C4 E ^^S?* 

for g large enough and some absolute constants C2,C3 and C4, since if we choose Ti > Tq 
independent of x and large enough such that N{2Ti, x) — N{Ti, x) ^ logg* (this is possible 
by the von-Mangoldt formula), then we have 

> E log^*- 

X6.F(g) 



□ 



Proof of Theorem Let be the random variable in (fl^ and define the symmetric 
random variable 

:=X,-E[X,]. 

By Lemma [3. II 

6{q;NR,R) = Prob[Xg > 0] 

= Prob[y, >-E[X,]] 

= Prob[rq < E[Xg]] = 1 - Prob[Fg > E[XJ]. 

The proof follows by taking J^{q) := {x mod q '■ X^ = Xoi X 7^ Xo] m Lemma 14.31 and by 
estimating L{q) as in the proof of Lemma [3.21 

□ 
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5. A MORE GENERAL ANALYSIS 

In this section we do a more general analysis by studying arbitrary linear combinations of 
prime counting functions. 

Throughout the section, = (ai,...,afc) will be a vector of invertible reduced residues 
modg and 7J" = (ai, ak) will be a non-zero vector of real numbers such that X]i=i — 0- 
Recall that 

1 if ttj = □ mod q 
if ttj ^ □ mod q 
and we assume without loss of generality that 

k 

^eiOi < 0. 

i=l 

To prove theorems I1.10[ 11.141 and I1.17[ we need a few lemmas. 
Lemma 5.1. Assume GRH and LI. Then the quantity 

E{y;q,a;d) := 0(g)- 



has the same distribution as that of the random variable 

^<?;a^,7^ := -p(g) X]^*"* + 5Z + ••• + "fcX(afc)| I ' (22) 

i=l XT^XO 7x>0 \ \ 



where the Z^^ are independent random variables following a uniform distribution on the unit 
circle in C. 

Remark 5.2. If we take ai, ci0(g)(i_p(q)-i) to be the set of all quadratic non-residues mod q 
with ai = ... = a</,(g)(i-p(g)-i) = and we take a<^(g)(i_p(,)-i)+i, a^(g) to be the set of 

all quadratic residues modg with Q;^(q)(i_p(5)-i)+i = ... = = ^^^^^ then we recover the 
formula dH]). 



Proof. In the same way as in the proof of Lemma 13. 1[ we get by the explicit formula and by 
applying GRH that 



, aiip{ey;q,ai) + ... + akip{ey;q,ak) 



F{y;q, a , a) : = (j){q)- 



= - X] + ••• + «fcX(afc)) h Oq(l), 

Xt^XO 7x 

(the main terms cancel since XliLi = 0)- -^Y ^^e work of Rubinstein and Sarnak |RS) . 
F{y; q, "cJ") has the same distribution as X^.-^ -^ — K[Xg.-^^-^], since LI implies that there 
are no real zeros. The second step is to use summation by parts and to remove squares and 
other prime powers; this gives that 

k 

E{y; q, "^) + p{q) ^ e^ai + o(l) = F{y; q, "S"), 

1=1 
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completing the proof. □ 

Before we give a bound on the variance of this distribution, we prove a lemma about 
conductors. 

Lemma 5.3. Let 1 < L < (f){q). Then, 

mod q '■ q* < L} < min{Lr(g), L^}. 
Proof. Denoting by (j)*{d) the number of primitive characters modg, we have 

J]0*(d)<min|^rf,Lj]l 

d\q I d<L d\q 

d<L 

□ 

Lemma 5.4. Assume LI. Let V{q]~(^,7^) := Var[Xg.-^^-^], where X^.-^-^ is the random 
variable defined in (122|) . Then, 

0(g)||^||^log(^^^ «y(g;^,7^)«0(g)||^||^logg, (23) 

where 

k 



i=l 



« 112 



Remark 5.5. The upper bound in (123|) is attained when q is prime by Lemma 15. 8[ As 
for the lower bound, if we take moduli q with a bounded number of distinct prime factors 
and consider the race between residues and non-residues with the weights of Remark 15. 2| we 
obtain by Lemma [3.21 that V{q; "o^, "3") = 0(1), and this is of the same order of magnitude 
as the lower bound in (l23l). 



Proof. Since the Z^^ in ( l22l) are independent and have variance equal to |, we have that 
Var[Xg.^_^] = \aix{ai) + ... +akx{ak)\^Y I l_ ^ - 

Xt^XO 7x 4 + ^^X 

(LI implies that there are no real zeros.) By ( fT6l) . there exists go such that whenever q* > go 
we have 

Er^^l^g^*' (24) 

7x ^^^^ 

and the same estimate clearly holds for g* < go, since the left hand side of (p4|) is positive. 
We conclude that 

y(g; 0^, "S')x |aix(ai) + ... + afcX(afc)riogg*. (25) 

Xt^XO 
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Now, + ••• + afeX(«fc) = 0, so 



^ |aix(ai) + ... + afcx(afc)P = ^ |aix(ai) + ••• + afcX(afc)P 

Xt^XO X mod q 



^^ij^k X mod 9 

k 



0(9) E 



Using this and (125 p . the upper bound follows from the fact that logg* < logg. This also 
gives the lower bound V{q;~ct ,7t) > log30(g)||"c^||2, which proves the claim for bounded 
values of (j){q)/k. Hence we assume from now on that (f){q)/k > 576. We fix a parameter 
1 < L < (j){q) and discard the characters of conductor at most L: 

V^(g; 0^, 7^) > logL ^ |aix(ai) + ... + 

X mod q: 

q*>L 

X mod q: 
q*>L 



logL 



i=l X mod g ISJ^T^i^fc 
g*>L 



X mod g: 
q*>L 



which by Lemma 15.31 and the orthogonality relations is 

" k 

Y(^1i.<t)i.(l) -'^'^'^{Lr{q),L'^}) - laja^ l min{Lr(g), L^} 

l<if^j<k 

- {k + l)mm{LT{q),L^}] 



> logL 



i=l 

>logL||^||2 



by the Cauchy-Schwartz inequality. Taking L := [3(f){q)/k)3 gives the result, since then 
(p{q)/k > 576 implies that {k + 1)1"^ < 4>{q)/2. 

□ 

Remark 5.6. In the last proof, we did not lose a lot by discarding the characters of con- 
ductor at most (30(g) //c) 3, since by Lemma [5.31 and the Cauchy-Schwartz inequality, their 
contribution is 

p0(g)\ ^ 



«0(g)||^||^lo 



\ k J 



Proof of Theorem \1.14\ We have by Lemma [5.41 that there exists an absolute constant c > 
such that 



A/Var[Xg.-^_^; 
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C(f){q) logg^J^^a^ 



a quantity which is greater or equal to (ce)~5 by the condition of the theorem. We conclude 
that 1 — 5{q] "cJ") < ce by using Chebyshev's bound in the same way as in the proof of 
Theorem 11.11 

□ 



Proof of Theorem \1.1(A It is a particular case of Theorem 11.141 



□ 



We now prove our negative results. To do so, we need to provide a central limit theorem, 
analogous to Lemma | 



Lemma 5.7. Let 



X. 



q; a ,a 



q; a ,a i 



q; a ,a • 



The characteristic function of Y„.-^ satisfies 



yq;7^A0 



in the range \^\ < -^wmr, wh 



+ 



ere 11 73" I 



log(30(g)//c) 

i=l l«i 



. ■ . fc^ log q \ 

'0(g)log(30(g)A)/ 



Proof. As in Lemma [3.41 we compute 



logX,^^,^(0 = 2E[X,^^,^]e + 5^ 5^ log I Jo 



2|aiX(«i) + ••• + afcX(«fc)|^ 



We now use the Taylor expansion (120|) . which is valid as soon as |^| < 
this condition we have 



since under 



2|aix(ai) + ■■ • + akX{ak)\\^\ ^ 2||"^||i 
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1 + ^2 1/2 5|^|i 5 

Using ( l24l) and the analogous estimate "^^^{^ + 7^)"^ ^ log 5'*, we get 



logV;.i?,7^(0 



^ _^ ^ I ^4 ^x^xo + - + afcX(afc) 1^ log q* 



\ 



Exy^xo + ••• + afcX(afc)plogg*j J 



(26) 



2\2 



If (j){q) /kis bounded, then the statement trivially follows from the bound ^ . af < a 
Therefore we assume from now on that (f){q)/k > 576. 

We now use two different approaches to bound the error term. The first idea is to "factor 
out i/log q* " before applying the trivial inequality Yli^^t — iYli^i)"^- We have seen in 
Remark 15.61 that the main contribution to the variance is that of the characters with q* > 
L ■= {3(j){q)/k)3. We use the same idea here. Setting := |aix(ai) + ... + afcx('^fc)P) 

24 



have 



Xt^XO 



Xt^XO 
q*>L 



Xt^XO 



Vxt^XO / 



(27) 



by Lemma and the Cauchy-Schwartz inequahty. Now, by our choice of L and by the fact 
that (f){q)/k > 576 we have 



0(g)||^||^-A;L2||^ii2 



Xt^XO 
q*>L 



Xt^XO 



hence (l27j) gives that 



Xt^XO Xt^XO 



Plugging this into and using the trivial bound ^^^^^ logg* < [Yl,^^^^ Q^\/\og q* 

we get that the error term is <^ logL. 

For the second upper bound we use Lemma [5.41 and the Cauchy-Schwartz inequality: 



(Ex^xo + ... + afcX(afc)nogg* 



log(30(g)/fc)2 (0(g)||7?||2)2 

logo' aiO^/ =aj a laod q 

log(30(g)/A;)2 



2 



^ logg 



log(30(g)/A:)2 <l>iq)\\^\\l ' 

which gives the claimed bound. □ 

Proof of Theorem \l.l'7[ Let K > 1 and define c > to be the constant implied in the lower 
bound in Lemma 15.41 Assume that k < 0(g) and that ([9]) holds with K2 = K. Define 
the vector := ip^j^TJ", so that ||^||i = e~^, which will allow us to apply Lemma 15.71 
Clearly, 

(5(g;"^,'a") = 6{q;~^,^), 

since multiplying 7^ by a positive constant does not affect the inequality Q;i7r(n; g, ai) + ... + 
afc7r(n; g, a^) > 0. 
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We have by Lemma 15.41 and by the definition of c that 



_ 1 
C 2 



log(30(g)A)EtiA^ 



0(g) log (30(g) A) Ell 



a quantity which is at most vi^ by ([9]). Defining 



q; a , 



Var[X 



we have by Lemma [5.71 and by our condition on k that in the range |,^| < |e 



3„K 



+ 



Combining this with the Berry-Esseen inequahty (12T]) and taking to be a standard Gauss- 
ian random variable with mean and variance 1 we get 

Prob[F -^^^ > -B{q- ^, ^)] - Prob[l^ > -5(g; ^, ^)] 



•C 



< e 



5^ 
3oK 



e 2 c, 



(28) 



e 



4/^ 



-K 



However, since -B(g; "5", ^) < V^, we have that 

Prob[Vr < -5(g;^,^)] > Ci 



e 2 



for some absolute constant Ci. Therefore, applying (p8|) gives 

5{q- ^, ^) = Prob[F -^^^ > -B{q- ^, ^)] 

= Prob[Vr > -B{q- ^, ^)] + 0(e-^) 
< 1 -cie-f /ir + cae"^, 

a quantity which is less than the right hand side of (fTOl) for K large enough. The proof is 
finished since 5(g; "o^, "ci") = 5(g; 7?", p ). □ 

To end this section we give an exact expression for the variance l^(g; "3"). While we 
have not explicitly made use of this expression, we include it for its intrinsic interest, and 
for its ability to give a precise evaluation of the variance V{q]~(^ , 7^) for values of g having 
prescribed prime factors. 
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Lemma 5.8. We have that 



A 



V^(g;^,7t) = 0(g)||7^||2 [logg- y''"'"^ ''^^ . (29) 

Proof. Using Proposition 3.3 of |FiMa] . we get 

V{q]~^,~^)= ^ \aix{,ai) + ...+ akX{.ak)? log q* 

X mod q 

^^hj^k X mod 9 



A 



(q,aia. ^ -1) 



□ 



It might seem like the second term of (129|) is an error term, however this is not necessarily 
true for large values of k (see Lemma [3.2p . Nevertheless, we expect many cancellations to 
occur since 

(fc \ 2 k 
i=l / i=l i=l 
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